Considerations in Dynamic Time Warping Algorithms for Discrete Word Recognition
نویسنده
چکیده
The technique of dynamic time warping for time registration of a reference and test utterance has found widespread use in the areas of speaker verification and discrete word recognition. As originally proposed, the algorithm placed strong constraints on the possible set of dynamic paths—namely it was assumed that the initial and final frames of both the test and reference utterances were in exact time synchrony. Because of inherent practical difficulties with satisfying the assumptions under which the above constraints are valid, we have considered some modifications to the dynamic time warping algorithm. In particular, an algorithm in which an uncertainty exists in the registration both for initial and final frames was studied. Another modification constrains the dynamic path to follow (within a given range) the path which is locally optimum at each frame. This modification tends to work well when the location of the final frame of the test utterance is significantly in error due to breath noise, etc. To test the different time warping algorithms a set of ten isolated words spoken by 100 speakers was used. Probability density functions of the distances from each of the 100 versions of a word to a reference version of the word were estimated for each of three dynamic warping algorithms. From these data, it is shown that, based on a set of assumptions about the distributions of the distances, the warping algorithm that minimizes the overall probability of making a word error is the modified time warping algorithm with unconstrained endpoints. A discussion of this key result along with some ideas on where the other modifications would be most useful is included.
منابع مشابه
Speaker-independent word recognition by less cost and stochastic dynamic time warping method
In this paper, we describe some considerations on a speaker-independent word recognition method on a large vocabulary size by the concatenation of syllable templates and a stochastic dynamic time warping method, where syllable templates are taken from spoken words. We got the reference patterns from 216 words uttered by 30 male speakers and recognized the other 200 words uttered by the other 10...
متن کاملRobot Arm Performing Writing through Speech Recognition Using Dynamic Time Warping Algorithm
This paper aims to develop a writing robot by recognizing the speech signal from the user. The robot arm constructed mainly for the disabled people who can’t perform writing on their own. Here, dynamic time warping (DTW) algorithm is used to recognize the speech signal from the user. The action performed by the robot arm in the environment is done by reducing the redundancy which frequently fac...
متن کاملAn investigation of the use of dynamic time warping for word spotting and connected speech recognition
Several variations on algorithms for dynamic time warping have been proposed for speech processing applications. In this paper two general algorithms that have been proposed for word spotting and connected word recognition are studied. These algorithms are called the fixed range method and the local minimum method. The characteristics and properties of these algorithms are discussed. It is show...
متن کاملSpeech Recognisation System Using Wavelet Transform
To develop speech recognition system with low word error rate using wavelet transform through pattern recognition approach. The aim of this paper is to make intelligent system that can recognize the speech signal. This includes also how the feature extracted from the speech signal using Discrete Wavelet Transform and then Dynamic Time Warping is used for pattern matching from the stored databas...
متن کاملPerformance Tradeoffs in Dynamic Time Warping Algorithms for Isolated Word Recognition
The technique of dynamic programming for the time registration of a reference and a test pattern has found widespread use in the area of isolated word recognition. Recently, a number of variations on the basic time warping algorithm have been proposed by Sakoe and Chiba, and Rabiner, Rosenberg, and Levinson. These algorithms all assume that the test input is the time pattern of a feature vector...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002